Organic acid synthesis from c1 substrates

ABSTRACT

Presented herein are biocatalysts and methods for converting C1-containing materials to organic acids such as muconic acid or adipic acid.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. Provisional Application No.62/212,264, filed Aug. 31, 2015, the contents of which are incorporatedby reference in their entirety.

CONTRACTUAL ORIGIN

The United States Government has rights in this invention under ContractNo. DE-AC36-08GO28308 between the United States Department of Energy andAlliance for Sustainable Energy, LLC, the Manager and Operator of theNational Renewable Energy Laboratory.

REFERENCE TO SEQUENCE LISTING

This application contains a Sequence Listing submitted as an electronictext file entitled “15-92_ST25.txt,” having a size in bytes of 224 kband created on Aug. 22, 2016. Pursuant to 37 CFR § 1.52(e)(5), theinformation contained in the above electronic file is herebyincorporated by reference in its entirety.

BACKGROUND

Methane is a critical component of Earth's carbon cycle that contributes18% to the Earth's warming. It is the major constituent of natural gasand biogas, composing up to 80% of the latter. CH₄ is emitted from avariety of natural and anthropogenic sources. Human-related activities,such as fossil fuel production (e.g., underground coal mining, oil andgas production), agriculture (e.g., enteric fermentation in livestock,manure management, and rice cultivation), landfills, and municipalwastewater are major contributors of global CH₄ emission. AnthropogenicCH₄ emission accounts for more than 60% of the total CH₄ budget (≈300 tgyr⁻¹).

CH₄ is not only one of the major contributors for climate change, it isalso the primary target for near-term climate regulation. Strandednatural gas (SNG), oil-associated gas and almost all waste-derivedbiogas are not economically feasible sources of energy due to small sizeor remote location. Each year, up to 116 million tonnes ofoil-associated methane and 40 million tonnes of biogas (equivalent to30% of the total US transportation fuel) are flared, which representslost energy (five quadrillion BTU of fossil fuel energy), unnecessarygreenhouse gas emissions and dangerous air pollution. Strategies foreffectively converting CH₄ into valuable compounds offer promising newtechnologies for global warming stabilization and possibly reduction.Methane-rich biogas offers a renewable alternative to fossil natural gasas a feedstock and intermediate in bioprocesses, adding to our capacityfor biofuels and biobased products to supplement those available fromlignocellulosics or algae.

The foregoing examples of the related art and limitations relatedtherewith are intended to be illustrative and not exclusive. Otherlimitations of the related art will become apparent to those of skill inthe art upon a reading of the specification and a study of the drawings.

SUMMARY

The following embodiments and aspects thereof are described andillustrated in conjunction with systems, tools and methods that aremeant to be exemplary and illustrative, not limiting in scope. Invarious embodiments, one or more of the above-described problems havebeen reduced or eliminated, while other embodiments are directed toother improvements.

Provided herein are engineered cells that are able to convert a C1substrate to an organic acid and that comprise one or more exogenouslyadded genes encoding a 3-dehydroshikimate dehydratase, a protocatechuicacid decarboxylase, a catechol 1,2-dioxygenase, aroG^(fbr), trpE^(fbr),a phosphoenolpyruvate synthase, a transketolase, a phosphoketolase,and/or a RuBisCO polypetide.

In some embodiments, the engineered cells comprise AroZ from Klebsiellavariicola, AroY from Enterobacter cloacae, and/or CatA fromAcinetobacter.

In various embodiments, the phosphoketolase is PktA or PktB and/or theRuBisCO polypetide is CbbL, CbbS, CbbQ and/or CbbP.

In certain embodiments, the engineered cell is a C1 metabolizing cell,such as cells from the genera Methylomonas, Methylobacter,Methylococcus, Methylosinus, Methylocystis, Methylomicrobium,Methanomonas, Methylophilus, Methylobacillus, Methylobacterium,Hyphomicrobium, Xanthobacter, Bacillus, Paracoccus, Nocardia,Arthrobacter, Rhodopseudomonas, Candida, Yarrowia, Hansenula, Pichia,Torulopsis, Rhodotorula or Pseudomonas; and/or methanotrophic bacterialcells or a methylotrophic (methanol-utilizing) bacterial or yeast cells.

In some embodiments, the C1 metabolizing cell is a methanotroph ormethylotroph; is from an organism from the genus Methylobacterium,Methylomicrobium, or Methylococcus (such as Methylobacterium extorquens,Methylobacterium radiotolerans, Methylobacterium populi,Methylobacterium chloromethanicum, Methylobacterium nodulans,Methylomicrobium alkaliphilum or a combination thereof); or is M.alkaliphilum 20Z or M. buryatense.

In various embodiments, the C1 substrate is methane, methanol, carbondioxide, carbon monoxide, syngas, natural gas, or biogas; and/or theorganic acid is muconic acid or adipic acid.

Also provided are methods for producing an organic acid by culturingcells described herein with a C1 substrate and recovering the organicacid from the culture.

In addition to the exemplary aspects and embodiments described above,further aspects and embodiments will become apparent by reference to thedrawings and by study of the following descriptions.

BRIEF DESCRIPTION OF THE DRAWINGS

Exemplary embodiments are illustrated in referenced figures of thedrawings. It is intended that the embodiments and figures disclosedherein are to be considered illustrative rather than limiting.

FIG. 1 shows the nucleic acid sequence (A; SEQ ID NO:1) and amino acidsequence (B; SEQ ID NO:2) of 3-dehydroshikimate dehydratase fromPodospora anserina.

FIG. 2 shows the nucleic acid sequence (A; SEQ ID NO:11) and amino acidsequence (B; SEQ ID NO:12) of protocatechuic acid decarboxylase fromEnterobacter cloacae.

FIG. 3 shows the nucleic acid sequence (A; SEQ ID NO:15) and amino acidsequence (B; SEQ ID NO:16) of catechol 1,2-dioxygenase from Candidaalbicans.

FIG. 4 shows the nucleic acid sequence (A; SEQ ID NO:33) and amino acidsequence (B; SEQ ID NO:34) of aroG^(fbr) from E. coli codon optimizedfor expression in Methylomicrobium alcaliphilum.

FIG. 5 shows the nucleic acid sequence (A; SEQ ID NO:35) and amino acidsequence (B; SEQ ID NO:36) of trpE^(fbr) from E. coli codon optimizedfor expression in Methylomicrobium alcaliphilum.

FIG. 6 shows the nucleic acid sequence (A; SEQ ID NO:37) and amino acidsequence (B; SEQ ID NO:38) of phosphoenolpyruvate synthase fromMethylomicrobium alcaliphilum 20Z.

FIG. 7 shows the nucleic acid sequence (A; SEQ ID NO:39) and amino acidsequence (B; SEQ ID NO:40) of transketolase from Methylomicrobiumalcaliphilum 20Z.

FIG. 8 shows a synthetic pathway for muconate biosynthesis from methane.E4P, erythrose 4-phosphate; PEP, phosphoenolpyruvate; DAHP,3-deoxy-d-arabino-heptulosonate-7-phosphate; DHQ, 3-dehydroquinate; DHS,3-dehydroshikimate; (1), DAHP synthase; (2), DHQ synthase; (3), DHQdehydratase; (4), Shikimate dehydrogenase; (5), 3-dehydroshikimatedehydratase; (6), protocatechuate decarboxylase; (7), catechol1,2-dioxygenase.

FIG. 9 shows a diagram of a vector useful for constitutive expression ofgenes such as AroZ, AroY and CatA in Methylomicrobium alcaliphilum 20Z.

FIG. 10 shows gene expression levels of cells engineered to express3-dehydroshikimate dehydratase (AroZ), protocatechuic acid decarboxylase(AroY) and catechol 1,2-dioxygenase (CatA).

FIG. 11 shows a diagram of a vector for expression of genes for enhancedflux to muconic acid pathway precursors.

FIG. 12 shows a diagram of the pCAH01 inducible expression vector. Knr(kanamycin resistance), Ampr (ampicillin resistance), tetR(transcriptionally-fused tetracycline repressor), OriV/OriT (IncP-basedorigin of replication/transfer), trfA (OriV replication initiationprotein).

FIG. 13 shows growth (OD₆₀₀) and cis,cis-muconic acid (ccMA) productionfor Methylomicrobium alcaliphilum 20Z cells engineered express3-dehydroshikimate dehydratase (AroZ), protocatechuic acid decarboxylase(AroY) and catechol 1,2-dioxygenase (CatA).

FIG. 14 shows increased carbon conversion efficiency from biocatalystsoverexpressing the phosphoketolase enzyme (pktA and pktB).

FIG. 15 shows the Embden-Meyerhof-Parnas (EMP) pathway, thephosphoketolase pathway (PKT) and how each acts on fructose-6-phosphate(F6P).

FIG. 16 shows an exemplary plasmid for high, constitutive expression ofgenes (such as Pkt isoforms PktA and PktB) from the methanoldehydrogenase promoter (Pmxa).

FIG. 17 shows (A) relative Pkt expression in wild-type (WT) M.buryatense cells or cells expressing plasmids containing PktA or PktB asdetermined by real-time PCR analysis; (B) SDS-PAGE analysis ofwhole-cell lysates demonstrating increased Pkt expression by engineeredstrains; and (C) in vitro conversion of fructose-6-phosphate (F6P) toacetyl-phosphate (AcP) by whole-cell lysates from wild-type (WT) cellsor cells expressing plasmids containing PktA or PktB.

FIG. 18 shows that Pkt overexpression increases carbon conversionefficiency (CCE) by (A) real-time CH₄ consumption (off gas analysis) forwild-type M. buryatense cells (WT; upper line) and Pkt-expressing cells(pPKT; lower line); (B) cell growth (DCW, dry cell weight) as measuredover time in a 0.5 L gas bioreactor with continuous CH₄ feed (20% CH₄ inair); and (C) the ratio of biomass DCW to total CH₄ consumption after 96hours.

FIG. 19 shows an exemplary vector for heterologous expression of theautotrophic pathway. The cbbL, cbbS, and cbbQ genes encoding the subunitand regulatory proteins of ribulose-1,5-bisphosphatecarboxylase/oxygenase (Rubisco) and phosphoribulokinase-encoding cbbPfrom Methylococcus capsulatus are included in the vector.

DETAILED DESCRIPTION

Presented herein are methods and biocatalysts for biological conversionof C1 substrates such as methane, methanol, and carbon-dioxide (ormaterials containing these compounds such as biogas) into organic acidssuch as muconic acid or adipic acid. Methanotrophs do not nativelyproduce muconic acid, but disclosed herein are processes for alteringmethanotrophs to enable the production of organic acids such as muconicacid or adipic acid from C1 (e.g., methane) containing sources.

The cells of methanotrophic organisms may be modified to express one ormore exogenously added genes encoding enzymes that allow the cell toconvert C1 substrates such as methane to muconic acid or adipic acid.Exemplary enzymes include 3-dehydroshikimate dehydratase, protocatechuicacid decarboxylase and catechol 1,2-dioxygenase enzymes. Specificexamples include the enzymes 3-dehydroshikimate dehydratase (AroZ) fromPodospora anserina, protocatechuic acid decarboxylase (AroY) fromEnterobacter cloacae, and catechol 1,2-dioxygenase (CatA) from Candidaalbicans, the nucleic acid and amino acid sequences of which areprovided in FIG. 1-3. Functional homologs of these enzymes from otherspecies are also suitable for use in the present disclosure.

Additional suitable examples of 3-dehydroshikimate dehydratases includethose from Klebsiella variicola (SEQ ID NOS:3 and 4), Bacillusthuringiensis (SEQ ID NOS:5 and 6), Enterobacter aerogenes (SEQ ID NOS:7and 8), and Acinetobacter baylyi (SEQ ID NOS:9 and 10). An additionalexemplary protocatechuic acid decarboxylase is the AroY gene fromKlebsiella variicola (SEQ ID NOS:13 and 14). Additional suitableexamples of catechol 1,2-dioxygenases include the CatA gene fromPseudomonas putida (SEQ ID NOS:19 and 20) and the CatA gene fromAcinetobacter spp. (SEQ ID NOS:17 and 18) as well as a variant versionof this gene where the proline residue at position 76 is mutated to analanine.

The cells may also be engineered to express one or more exogenouslyadded genes encoding aroG^(fbr), trpE^(fbr), a phosphoenolpyruvatesynthase, or a transketolase. The designation “fbr” refers to feedbackresistant variants of the indicated enzymes. Suitable examples includearoG^(fbr) from E. coli, trpE^(fbr) from E. coli, phosphoenolpyruvatesynthase from Methylomicrobium alcaliphilum 20Z or transketolase fromMethylomicrobium alcaliphilum 20Z, the nucleic acid and amino acidsequences of which are provided in FIG. 4-7. Functional homologs ofthese enzymes from other species are also suitable for use in thepresent disclosure. All nucleic acid sequences may be codon optimizedfor expression in the host cell.

Muconic acid is a dicarboxylic acid that may be converted viahydrogenation into adipic acid with high yield and specificity.Currently, nylon-6,6 accounts for greater than 85% of global adipic acidproduction. The dicarboxylic functionality of adipic acid affords a widevariety of upgrading strategies including polymerization, lactonization,diolization, and ketonization. As such, adipic acid can be readilyconverted to other large-market, high-value molecules and fuelprecursors, such as plasticizers, lubricants, engineering resins,polyurethanes, and food gelatin. Additionally, muconic acid itself canbe upgraded to produce commodity chemicals beyond adipic acid, includingterepthalic and trimellitic acids. Conventional adipic acid productionis almost exclusively dependent on petroleum-derived feedstocks andaccounts for nearly 10% of global N₂O emissions. Thus, the production ofmuconic acid or adipic acid through a biocatalytic upgrading processfrom renewable methane sources offers a novel production route withsignificant economic and sustainability benefits. Furthermore, thediversity of products that can be made from adipic acid assures a largedemand for the heretofore poorly valorized biogas.

Microbial utilization of CH₄ (methanotrophy) can occur in both aerobicand anaerobic environments, but only aerobic methanotrophic bacteria(methanotrophs) have been isolated in pure culture. Methanotrophs usemethane monooxygenase for the first oxidation step that converts CH₄into methanol (CH₃OH), which is further oxidized to formaldehyde (CH₂O).Gammaproteobacteria, such as M. alcaliphilum 20Z, assimilateformaldehyde through the assimilatory ribulose monophosphate (RuMP)pathway. The first part of the pathway is the condensation of CH₂O withribulose-5-phosphate (Ru5P) to produce 3-hexulose-6-phosphate, which issubsequently isomerized to fructose-6-phosphate.

Muconic acid production naturally occurs during the catabolism ofaromatic compounds in a limited number of microorganisms. Production ofmuconic acid in various microbes has been achieved by exploitingnaturally occurring intermediary metabolic pathways involved in thedetoxification or catabolism of aromatic compounds such as toluene andbenzoate. Methanotrophs do not naturally possess a pathway to producemuconic acid, and are obligate C1-utilizers. As such, a novel metabolicroute to muconic acid biosynthesis must be employed in order to leveragethe innate capacity of methanotrophic biocatalysts.

Shunting naturally occurring metabolites toward muconic acid presents adirect approach for synthesizing this product from renewable feedstocks.The shikimate pathway is involved in the synthesis of aromatic aminoacids, and muconic acid can be synthesized in a three step process fromthe 3-dehydroshikimate (DHS) intermediate produced in this pathway (FIG.8). Alternatively, muconic acid can be generated by converting theanthranilate intermediate of tryptophan biosynthesis into catechol bythe anthranilate 1,2 dioxygenase.

Methylomicrobium spp. do not require exogenous amino acids for growth,indicating that the up-front metabolic machinery is already in place.Genomic analysis of M. alkaliphilum 20Z confirmed all genes of theshikimate and pentose phosphate (non-oxidative, PPP) pathways arepresent, verifying the ability to produce DHS. Further, the strain hashigh flux via PPP and a high pool of the key precursors for the proposedroute of muconate biosynthesis, highlighting the feasibility ofengineering muconate production in the organism.

Synthetic design of a three-step muconate biosynthesis pathway may beestablished and optimized in bacteria (e.g., P. putida) and yeast, usingaromatics and glucose as precursors for muconic acid biosynthesis,respectively. Alternatively, muconate biosynthesis may be connected tomethane precursors. In one specific example, a three-step syntheticpathway comprised of the enzymes 3-dehydroshikimate dehydratase fromPodospora anserina, protocatechuic acid decarboxylase from Enterobactercloacae, and catechol 1,2-dioxygenase from Candida albicans may beincorporated into M. alcaliphilum 20Z. The genes may be codon optimizedfor 20Z and can be used in the construction of both integrative andreplicative plasmids.

Suitable cells include those able to convert a C1 substrate to anorganic acid or a C1 metabolizing cell, including methanotrophicbacterial cells or methylotrophic (methanol-utilizing) bacterial oryeast cells. In certain embodiments, the cell is a methanotroph ormethylotroph.

Examples include cells from the genera Methylomonas, Methylobacter,Methylococcus, Methylosinus, Methylocystis, Methylomicrobium,Methanomonas, Methylophilus, Methylobacillus, Methylobacterium,Hyphomicrobium, Xanthobacter, Bacillus, Paracoccus, Nocardia,Arthrobacter, Rhodopseudomonas, Candida, Yarrowia, Hansenula, Pichia,Torulopsis, Rhodotorula or Pseudomonas. Specific examples include cellsfrom Methylobacterium extorquens, Methylobacterium radiotolerans,Methylobacterium populi, Methylobacterium chloromethanicum,Methylobacterium nodulans, Methylomicrobium buryatense, Methylomicrobiumalkaliphilum or a combination thereof. In some embodiments, the cell isM. alkaliphilum 20Z.

In various embodiments, the methanotroph is a Methylomonas sp. 16a (ATCCPTA 2402), Methylosinus trichosporium (NRRL B-1 1,196), Methylosinussporium (NRRL B-1 1,197), Methylocystis parvus (NRRL fil 1, 198),Methylomonas methanica (NRRL B-1 1,199), Methylomonas albus (NRRL fil 1,200), Methylobacter capsulatus (NRRL B-1 1,201), Methylobacteriumorganophilum (ATCC 27,886), Methylomonas sp. AJ-3670 (FERM P-2400),Methylocella silvestris, Methylacidiphilum infernorum, Methylibiumpetroleiphilum, or a combination thereof.

In certain embodiments, the organic acid may be muconic acid or adipicacid. The C1 substrate may be a C1 compound such as methane, methanol,carbon dioxide, or carbon monoxide; or a C1-containing material such assyngas, natural gas, or biogas; or materials comprising any of thesecompounds or materials. Biogas, such as that generated from anaerobicdigestion of waste streams, including wastewater derived fromconventional biorefineries, is one example of a versatile, renewableC1-containing material source. Currently, biogas is utilized for on-siteheat and energy production or sold into the national grid for minimalcompensation. However, microbial conversion of biogas to value-addedchemicals using the modified microorganisms disclosed herein offersimmense valorization potential.

These biocatalysts hold great promise for the capture and conversion ofmethane from anaerobic-digestion-derived biogas and other anthropogenicCH₄ sources. However, biogas contains more than just methane, with CO₂typically comprising about 40% of biogas streams. A biocatalyst capableof co-utilization and conversion of CO₂ and CH₄ allows for completeutilization of biogas streams and, in turn, enhanced carbon conversionefficiencies. Such a biocatalyst may also shift the landscape ofgreenhouse gas mitigation, capture, and conversion pursuits, providing anovel, photosynthesis-independent CO₂ biocatalyst.

The biocatalyst microorganisms described herein may be further modifiedto express or overexpress a ribulose 1,5 bisphosphatecarboxylase/oxygenase (RuBisCO) enzyme that can allow autotrophic growthin the presence an energy source such as methane, formate or hydrogen).Both homologous and heterologous overexpression of RuBisCO enzymes mayincrease CO₂ utilization in an array of methanotrophs, including thosethat do not encode genes that enable autotrophic growth.

Exemplary enzymes include the cbbL, cbbS and cbbQ genes encoding thesubunit and regulatory proteins of RuBisCO, along with thephosphoribulokinase-encoding cbbP. For example, the nucleotide and aminoacid sequences for cbbL, cbbS, cbbQ and cbbP from Methylococcuscapsulatus are depicted as SEQ ID NOS:25 and 26 (cbbL); SEQ ID NOS:27and 28 (cbbS); SEQ ID NOS:29 and 30 (cbbQ); and SEQ ID NOS:31 and 32(cbbP), respectively. Functional homologs of these enzymes from otherspecies are also suitable for use in the present disclosure.

Additional embodiments include biocatalysts engineered to express oroverexpress one or more genes encoding a phosphoketolase enzyme.Phosphoketolases catalyze the phosphorylytic cleavage ofxylulose-5-phosphate to acetyl-phosphate and glyceraldehyde-3-phosphateor fructose-6-phosphate to acetyl-phosphate and erythrose-4-phosphate.In many methanotrophs, methane is assimilated into fructose-6-phosphate.As depicted in FIG. 15, expression of phosphoketolase by microorganismssuch as methanotrophic bacteria may bypass pyruvate decarboxylation andconvert some or all of available fructose-6-phospate to intermediates ofthe non-oxidative pentose phosphate pathway and acetyl-phosphate. Such amodification may increase the carbon conversion efficiency inmethanotrophic biocatalysis.

Exemplary phosphoketolases include PktA and PktB from bacteria from thegenus Methylomicrobium. The nucleotide and amino acid sequences for PktAand PktB from M. buryatense, for example, are provided as SEQ ID NOS:21and 22 (PktA) and SEQ ID NOS:23 and 24 (PktB), respectively. Functionalhomologs of these enzymes from other species are also suitable for usein the present disclosure. Expression may be constitutive or may beinducible to allow a convenient means to switch between the conventionalEmbden-Meyerhof-Parnas (EMP) pathway and the phosphoketolase-mediatednon-oxidative pentose phosphate pathway (see FIG. 15).

Cells may be cultured using conventional techniques and media that willvary with the cell type. Methanotrophic cells can be cultured in eithermethanol or methane at a wide range of temperatures depending on thenature of the cells. Examples include temperatures ranging from 20° C.to 65° C., from 30° C. to 37° C., or temperatures greater than 37° C. orgreater than 45° C. Methanol may be added directly to the medium at0.1%-5%. Gases such as methane may be bubbled into the liquid mediumcontinuously, or added into the headspace of sealed vials. Gases may bemethane, methane/air mixtures or an array of biogas or natural gasstreams that may or may not be mixed with air or varying concentrationsof oxygen. For example, methanotrophic cells may be cultured in NMS2medium at 30° C. with orbital shaking at 175 rpm. Strains may be grownin sealed 1 L glass serum bottles with 25% methane in air, or 500 mLbaffled flasks supplemented with 1% methanol. Additional cultivationtechniques may also be suitable, including growth in solid media orlarge scale fermenters.

In order to facilitate regulated, heterologous gene expression inmethanotrophs, an inducible, broad-host range vector, pCAH01 (FIG. 12),was constructed by fusing the tetracycline promoter/operator (tetp/o orPtet) from pASK75 with the IncP-based origin of the pAWP78 vector thatcan be replicated by Methylomicrobium spp. Experiments using GFPfluorescence as a readout of promoter activity indicated tightlycontrolled tetp/o-mediated gene expression in M. buryatense afterinduction with sub-lethal concentrations of the anhydrotetracyclineinducer. The tetp/o did not show any leaky gene expression in theabsence of inducer, making it an excellent tool for conditional geneexpression/knock-out studies in methanotrophic bacteria that replicatevectors containing the IncP-based origin of replication.

In certain embodiments, a nucleic acid may be identical to the sequencerepresented herein. In other embodiments, the nucleic acids may be leastabout 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%,93%, 94%, 95%, 96%, 97%, 98% or 99% identical to a nucleic acid sequencepresented herein, or 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%,90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identical to anucleic acid sequence presented herein. Sequence identity calculationscan be performed using computer programs, hybridization methods, orcalculations. Exemplary computer program methods to determine identityand similarity between two sequences include, but are not limited to,the GCG program package, BLASTN, BLASTX, TBLASTX, and FASTA. The BLASTprograms are publicly available from NCBI and other sources. Forexample, nucleotide sequence identity can be determined by comparingquery sequences to sequences in publicly available sequence databases(NCBI) using the BLASTN2 algorithm.

The nucleic acid molecules exemplified herein encode polypeptides withamino acid sequences represented herein. In certain embodiments, thepolypeptides may be at least about 80%, 81%, 82%, 83%, 84%, 85%, 86%,87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99%identical to the reference amino acid sequence while possessing thefunction. The present disclosure encompasses cells that contain thenucleic acid molecules described herein, have genetic modifications tothe nucleic acid molecules, or express the polypeptides describedherein.

“Nucleic acid” or “polynucleotide” as used herein refers to purine- andpyrimidine-containing polymers of any length, either polyribonucleotidesor polydeoxyribonucleotide or mixed polyribo-polydeoxyribonucleotides.This includes single- and double-stranded molecules (i.e., DNA-DNA,DNA-RNA and RNA-RNA hybrids) as well as “protein nucleic acids” (PNA)formed by conjugating bases to an amino acid backbone. This alsoincludes nucleic acids containing modified bases.

Nucleic acids referred to herein as “isolated” are nucleic acids thathave been removed from their natural milieu or separated away from thenucleic acids of the genomic DNA or cellular RNA of their source oforigin (e.g., as it exists in cells or in a mixture of nucleic acidssuch as a library), and may have undergone further processing. Isolatednucleic acids include nucleic acids obtained by methods describedherein, similar methods or other suitable methods, including essentiallypure nucleic acids, nucleic acids produced by chemical synthesis, bycombinations of biological and chemical methods, and recombinant nucleicacids that are isolated. In certain embodiments, the nucleic acids arecomplementary DNA (cDNA) molecules.

The nucleic acids described herein may be used in methods for productionof organic acids through incorporation into cells, tissues, ororganisms. In some embodiments, a nucleic acid may be incorporated intoa vector for expression in suitable host cells. Alternatively,gene-targeting or gene-deletion vectors may also be used to disrupt orablate a gene. The vector may then be introduced into one or more hostcells by any method known in the art. One method to produce an encodedprotein includes transforming a host cell with one or more recombinantnucleic acids (such as expression vectors) to form a recombinant cell.The term “transformation” is generally used herein to refer to anymethod by which an exogenous nucleic acid molecule (i.e., a recombinantnucleic acid molecule) can be inserted into a cell, but can be usedinterchangeably with the term “transfection.”

Suitable vectors for gene expression may include (or may be derivedfrom) plasmid vectors that are well known in the art, such as thosecommonly available from commercial sources. Vectors can contain one ormore replication and inheritance systems for cloning or expression, oneor more markers for selection in the host, and one or more expressioncassettes. The inserted coding sequences can be synthesized by standardmethods, isolated from natural sources, or prepared as hybrids. Ligationof the coding sequences to transcriptional regulatory elements or toother amino acid encoding sequences can be carried out using establishedmethods. A large number of vectors, including algal, bacterial, yeast,and mammalian vectors, have been described for replication and/orexpression in various host cells or cell-free systems, and may be usedwith genes encoding the enzymes described herein for simple cloning orprotein expression.

Certain embodiments may employ promoters or regulatory operons. Theefficiency of expression may be enhanced by the inclusion of enhancersthat are appropriate for the particular cell system that is used, suchas those described in the literature. Suitable promoters also includeinducible promoters. Expression systems for constitutive expression incells include, for example, the vectors described in the figures.Inducible expression systems are also suitable for use.

Host cells can be transformed, transfected, or infected as appropriatewith gene-disrupting constructs or plasmids (e.g., an expressionplasmid) by any suitable method including electroporation, calciumchloride-, lithium chloride-, lithium acetate/polyethylene glycol-,calcium phosphate-, DEAE-dextran-, liposome-mediated DNA uptake,spheroplasting, injection, microinjection, microprojectile bombardment,phage infection, viral infection, or other established methods.Alternatively, vectors containing a nucleic acid of interest can betranscribed in vitro, and the resulting RNA introduced into the hostcell by well-known methods, for example, by injection. Exemplaryembodiments include a host cell or population of cells expressing one ormore nucleic acid molecules or expression vectors described herein (forexample, a genetically modified microorganism). The cells into whichnucleic acids have been introduced as described above also include theprogeny of such cells.

Vectors may be introduced into host cells by direct transformation, inwhich DNA is mixed with the cells and taken up without any additionalmanipulation, by conjugation, electroporation, or other means known inthe art. Expression vectors may be expressed by host cells episomally orthe gene of interest may be inserted into the chromosome of the hostcell to produce cells that stably express the gene with or without theneed for selective pressure. For example, expression cassettes may betargeted to neutral chromosomal sites by double recombination.

Host cells with targeted gene disruptions or carrying an expressionvector (i.e., transformants or clones) may be selected using markersdepending on the mode of the vector construction. The marker may be onthe same or a different DNA molecule. In prokaryotic hosts, thetransformant may be selected, for example, by resistance to ampicillin,tetracycline or other antibiotics. Production of a particular productbased on temperature sensitivity may also serve as an appropriatemarker.

In exemplary embodiments, the host cell may be a microbial cell, such asa bacterial cell or a yeast cell, and may be from any genera or speciesof microorganism that is known to consume C1 substrates and isgenetically manipulable. Exemplary microorganisms include, but are notlimited to, bacteria; fungi; archaea; protists; eukaryotes, such asalgae; and animals such as plankton, planarian, and amoeba.

Host cells may be cultured in an appropriate fermentation medium. Anappropriate, or effective, fermentation medium refers to any medium inwhich a host cell, including a genetically modified microorganism, whencultured, is capable of growing and producing products such as organicacids. Such a medium is typically an aqueous medium comprisingassimilable carbon, nitrogen and phosphate sources, but can also includeappropriate salts, minerals, metals and other nutrients. Microorganismsand other cells can be cultured in conventional fermentation bioreactorsor photobioreactors and by any fermentation process, including batch,fed-batch, cell recycle, and continuous fermentation. The pH of thefermentation medium is regulated to a pH suitable for growth of theparticular organism. Culture media and conditions for various host cellsare known in the art. A wide range of media for culturing cells, forexample, are available from ATCC.

Isolation or extraction of organic acids from the cells may be aided bymechanical processes such as crushing, for example, with an expeller orpress, by supercritical fluid extraction, pH-induced precipitation, orthe like. Once the organic acids have been released from the cells, theycan be recovered or separated from a slurry of debris material (such ascellular residue, by-products, etc.). This can be done, for example,using techniques such as sedimentation or centrifugation. Recoveredorganic acids can be collected and directed to a conversion process ifdesired.

Processes for producing an organic acid may also comprise a removingstep, wherein a portion of the organic acid is removed from the mixtureto form substantially a pure organic acid. In some embodiments, theremoving step may comprise a two-step process, wherein the first stepcomprises an adorption process wherein the organic acid is not adsorbed,and other components are selectively adsorbed. In some embodiments ofthe present invention, a first step comprising adorption will result ina substantially pure organic acid stream.

The first step may be followed by a second step polishing step. Thesecond step may comprise crystallization. In still further embodimentsof the present invention, the removing step is at least one of affinitychromatography, ion exchange chromatography, solvent extraction,liquid-liquid extraction, distillation, filtration, centrifugation,electrophoresis, hydrophobic interaction chromatography, gel filtrationchromatography, reverse phase chromatography, chromatofocusing,differential solubilization, preparative disc-gel electrophoresis,isoelectric focusing, HPLC, reversed-phase HPLC, or countercurrentdistribution, or combinations thereof. In some embodiments of thepresent invention, the removing step may be performed either during themixing step or subsequent to the mixing step or both.

Separation/purification operations may be used to generate an organicacid free of or substantially free of a wide variety of impurities thatmay be introduced during the biological production of an organic acid.These impurities may include fermentation salts, nutrients and media tosupport growth, unconverted substrate, extracellular proteins and lysedcell contents, as well as the buildup of non-target metabolites.Accumulation of these constituents in culture broth may vary greatlydepending on the microorganism, substrate used for conversion,biological growth conditions and bioreactor design, and brothpretreatment.

Cell removal from the broth may be achieved by a variety of solidremoval unit operations. Some examples include filtration,centrifugation, and combinations thereof. Once the microorganism cellmatter has been removed, further impurity removal operations may beutilized such as exposing the mixture to activated carbon.

EXAMPLES Example 1 Plasmid and Strain Construction

Plasmids for heterologous gene expression were constructed using 2×Gibson Assembly Mix from New England Biolabs (Ipswich, Mass.) followingthe manufacturer's protocol. Polymerase chain reactions were performedusing Q5 High-Fidelity Polymerase from New England Biolabs and primersfrom Integrated DNA Technologies (Coralville, Iowa). Final constructswere confirmed by sequence analysis (Genewiz, South Plainfield, N.J.).Plasmid constructs were transformed into M. alcaliphilum viaconjugation. Positive transformants selected on P agar containing 100μg/mL of kanamycin were confirmed using plasmid-specific primers inpolymerase chain reactions. Escherichia coli Zymo 5a (Zymo Research,Irvine, Calif.) was used for cloning and plasmid propagation, and E.coli S17-1 λpir was used as the conjugation donor strain. E. colistrains were grown at 37° C. in Luria-Bertani (LB) broth supplementedwith 50 ug/mL of kanamycin.

Cultivation, Growth Parameters, and Bioreactor Fermentations

Methylomicrobium alcaliphilum 20Z were routinely cultured in NMS2 medium(3% NaCl) at 30° C. with orbital shaking at 225 rpm. Strains were grownin sealed 150 mL glass serum bottles (Kimble Chase, Vineland, N.J.) with20% (v/v) CH₄ in air, or 500 mL baffled flasks supplemented with 1%CH₃OH (v/v).

Batch CH₄ fermentations were performed in a 0.5 L or 5.0 L BioFlo batchbioreactor (New Brunswick Scientific, Edison, N.J.) containing NMS2medium supplemented with 8× KNO3, 2× phosphate buffer, and 4× traceelement solution to support high cell growth. The temperature wasmaintained at 30° C., and mixing was achieved by using a bottom marineimpeller and mid-height Rushton impeller at 500 rpm. A continuous flowrate of 300 ccm (0.5 L) or 3000 ccm (5.0 L) 20% (v/v) methane in air wasmaintained. Antifoam was added as needed.

Off-Gas Analysis and Detection of Organic Acids

CH₄ and CO₂ off-gas was measured every 20 minutes for the duration ofbioreactor fermentations by using infrared-based BlueSens gas detectorsspecific for CH₄ and CO₂ (Herten, Germany). CH₄ consumption wasdetermined by calculating the difference between off-gas detected beforeand after inoculation. Percentage CH₄ consumption was converted toweight based on CH₄ density and the flow rate (56.64 g CH₄/24 hours, 0.5L reactor; 566.4 g CH₄/24 hours, 5.0 L reactor).

High pressure liquid chromatography (HPLC) was used to detect excretedproducts in culture supernatants. At the indicated time, the OD₆₀₀ wasmeasured and a 1 mL sample was taken for HPLC analysis. Culturesupernatant was filtered using a 0.2 μm syringe filter or 0.5 mL 10KMWCO centrifuge tube (Life Technologies) and then separated using amodel 1260 HPLC (Agilent, Santa Clara, Calif.) and a Rhenomonex RezexRFQ-Fast Fruit H+ column (Bio-Rad). A 0.1 mL injection volume was usedin 0.01 N sulfuric acid with a 1 mL/minute flow rate at 85° C.Refractive index and diode array detectors were used for compounddetection, which were confirmed to have matching UV spectral profiles aspure compounds. Concentrations were calculated by regression analysiscompared to known standards. FIG. 13 shows the production levels ofcis,cis-muconic acid from a culture of Methylomicrobium alcaliphilum 20Zengineered to express the AroZ and AroY genes from K. variicola alongwith the CatA gene from Acinetobacter spp. (P76A variant).

Example 2 Expression of the Autotrophic Pathway

The cbbL, cbbS, and cbbQ genes encoding the subunit and regulatoryproteins of ribulose-1,5-bisphosphate carboxylase/oxygenase (Rubisco)and the phosphoribulokinase-encoding cbbP gene from Methylococcuscapsulatus were cloned into the inducible expression vector pCAH01 tocreate pRubisco (FIG. 19).

The pRubisco vector has been transformed into Methylomicrobiumalcaliphilum cells via biparental mating and confirmed via PCR analysis.Heterologous expression of Rubisco and phosphoribulokinase was confirmedby SDS-PAGE analysis.

The Examples discussed above are provided for purposes of illustrationand are not intended to be limiting. Still other embodiments andmodifications are also contemplated.

While a number of exemplary aspects and embodiments have been discussedabove, those of skill in the art will recognize certain modifications,permutations, additions and sub combinations thereof. It is thereforeintended that the following appended claims and claims hereafterintroduced are interpreted to include all such modifications,permutations, additions and sub-combinations as are within their truespirit and scope.

1. An engineered cell, comprising an exogenously added gene encoding a3-dehydroshikimate dehydratase (AroZ), a protocatechuic aciddecarboxylase (AroY), a catechol 1,2-dioxygenase (CatA), aphspho-2-dehydro-3-deoxyheptonate aldolase (AroG), an anthranilatesynthase (TrpE), a phosphoenolpyruvate synthase, or a transketolase;wherein the engineered cell is able to convert a C1 substrate to anorganic acid.
 2. The engineered cell of claim 1, wherein the cellcomprises genes encoding a 3-dehydroshikimate dehydratase (AroZ), aprotocatechuic acid decarboxylase (AroY), and a catechol 1,2-dioxygenase(CatA).
 3. The engineered cell of claim 2, wherein the3-dehydroshikimate dehydratase (AroZ) is from Klebsiella variicola. 4.The engineered cell of claim 2, wherein the protocatechuic aciddecarboxylase (AroY) is from Enterobacter cloacae.
 5. The engineeredcell of claim 2, wherein the catechol 1,2-dioxygenase (CatA) is fromAcinetobacter.
 6. The engineered cell of claim 2, wherein the3-dehydroshikimate dehydratase (AroZ) is from Klebsiella variicola, theprotocatechuic acid decarboxylase (AroY) is from Enterobacter cloacae,and the catechol 1,2-dioxygenase (CatA) is from Acinetobacter.
 7. Theengineered cell of claim 1, wherein the cell further comprises anexogenously added gene encoding a phosphoketolase.
 8. The engineeredcell of claim 7, wherein the phosphoketolase is PktA or PktB.
 9. Theengineered cell of claim 1, wherein the cell further comprises anexogenously added gene encoding a CbbL, CbbS, CbbQ or CbbP polypeptide.10. The engineered cell of claim 9, wherein the cell further comprisesexogenously added genes encoding CbbL, CbbS, CbbQ and CbbP polypeptides.11. The engineered cell of claim 1, wherein the engineered cell is a C1metabolizing cell.
 12. The engineered cell of claim 11, wherein the C1metabolizing cell is selected from the genera Methylococcus.
 13. Theengineered cell of claim 11, wherein the C1 metabolizing cell is amethanotrophic bacterial cell or a methylotrophic (methanol-utilizing)bacterial or yeast cell.
 14. The engineered cell of claim 11, whereinthe C1 metabolizing cell is a methanotroph or methylotroph.
 15. Theengineered cell of claim 11, wherein the C1 metabolizing cell is from anorganism from the genus Methylococcus.
 16. The engineered cell of claim11, wherein the C1 metabolizing cell is Methylococcus capsulatus. 17.The engineered cell of claim 11, wherein the C1 metabolizing cell is M.alkaliphilum 20Z or M. buryatense.
 18. The engineered cell of claim 1,wherein the C1 substrate is methane, methanol, carbon dioxide, carbonmonoxide, syngas, natural gas, or biogas.
 19. The engineered cell ofclaim 1, wherein the organic acid is muconic acid or adipic acid.
 20. Amethod for producing an organic acid, comprising: a) culturing theengineered cell of claim 1 with a C1 substrate; and b) recovering theorganic acid from the culture.